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Abstract 

We establish several sum-product estimates over finite fields that involve polyno- 
mials and rational functions. 

First, \f(A) + f(A) \ + \AA\ is substantially larger than \A\ for an arbitrary poly- 
nomial / over F p . Second, a characterization is given for the rational functions / and 
g for which \f{A) + f(A)\ + \g(A,A)\ can be as small as \A\, for large \A\. Third, 
we show that under mild conditions on /, |/(^4,A)| is substantially larger than \A\, 
provided \A\ is large. 

We also present a conjecture on what the general sum-product result should be. 

1 Introduction and statement of the results 

Sum-product estimates For a polynomial f(x\, . . . ,Xk) and sets A\, . . . , A^ define 
f(A\, . . . , Ak) = {/(oi, . . . , Ofe) : en G Ai}. As mostly we will be dealing with the case 
A\ = ■ ■ ■ = Ak, we write f(A) = f(A, . . . , A) for brevity. We also employ the notation 
A + B = {a + b : a G A, b G B} and AB = {ab : a G A, b G B}. The notation 
X <C Y means X < CY for some effective absolute constant C, whereas X <Cr,s,... Y 
means X < C(r,s,...)Y for some function C. 

The sum-product theorem of Erdos and Szemercdi [ES83] states that if A is a finite 
set of real numbers, then either A + A or A A has at least |A| 1+C elements where c > 
is an absolute constant. After several improvements, the current record, due to Solymosi 
[S0IO8], is that the result holds with c = 1/3 — o(l), and it is conjectured that c = 1 — o(l) 
is admissible. 

In practice, most applicable are the sum-product estimates when A is in a finite field 
or a finite ring. Not only they have been used to tackle a wide range of problems (see 
|Bou09j for a survey), but they are also more general, for as was shown in [VWW08] the 
uniform sum-product estimates in F p imply the sum-product estimates over the complex 
numbers. The first estimate in ¥ p was proved by Bourgain, Katz and Tao |BKT04| for 
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\A\ > p 5 for arbitrarily small, but fixed 5 > 0. The restriction was subsequently removed 
by Bourgain and Konyagin [BK03 . There was a rapid series of improvements, with the 
best known bounds for A C F p being 



\A + A\ + \AA\ > < 



Vl 12/11 "° (1) , if \A\ < P x J 2 ,{see [RudlOp . 

|A| 13 / 12 (|A|/ > /p) 1 /12-o(l) j if pl/2 < | A | < p 35/68 (gee PQ9] ), 

lAKp/IADVll-oW, if p 35/68 < |^| < p 13/24 ( gee pQgj) > (1) 

|A| • l^l/^/p, if p 13 / 24 < \A\ < p 2 / 3 (see (Gar08bj ). 

^\A\(p/\A\)V 2 , if |A| > p 2 / 3 (see |Gar08bj ). 



Of these results, Garaev's estimate of \A\ (p/|j4| ) 1 / 2 for \A\ > p 2 ! 3 is notable in that it is 
the only sharp bound. It is likely that + + > min(|A| (p/|,4|) 1/2 " o(1) , | A\ 2 -°^). 

Of use are also the statements that one of A + A or f(A) is substantially larger than A, 
where / is a rational function, that is possibly different from f{x, y) = xy. For example, 
Bourgain |Bou05| showed that either A + A or 1/A + l/A is always large, and used this 
estimate to give new bounds on certain bilinear Kloosterman sums. In application to a 
construction of extractors, in the same paper Bourgain asked for sum-product estimates for 
f(x\, . . . , Xk) = x\ + ■ ■ ■ + x\. The most general result of the kind is due to Vu |Vu08] . who 
generalized an earlier argument of Hart, Iosevich and Solvmosi [HIS07| . Call a polynomial 
f(x,y) degenerate, if it is a function of a linear form in x and y. In |Vu08j it was shown 
that if / is a bivariate non-degenerate polynomial of degree d, then 

\A + A\ + \f(A)\ » \\ A \{\A\/d 2 ^>yi 2 d-\ if P V2 < w < 

' 11 h \|A|(p/|A|) 1 /3 d -l/3 ) if \ A \ > d V5 p 7/10_ ^ 

The same argument was used in [HLS09j to establish a version of (|2|) for + + |/(A)|. 

As nearly all applications of sum-product estimates in finite fields have taken advantage 
of validity of the estimates for A of very small size, it is of interest to extend this result 
to \ A\ < y/p. The result can also be improved qualitatively because for some polynomials 
/ it is true that f(A) is much larger than A no matter how large or small A + A is. 
So, for example, Bourgain |Bou05| showed that x 2 + xy and x(y + a) for a / are such 
polynomials (Bourgain actually showed that x 2 + xy grows even if x and y range over 
different sets; see also |HH 09j for a generalization). 

The sum-product estimate results are connected to the problem of giving good upper 
bounds on the number points in a Cartesian product set, such as A x A x A, that lie on 
a given variety. For example, the estimate ([2]) is related to a bound on the the number 
of points on a surface, in the case A has small additive doubling (see Lemma [20] for the 
explicit form). Some results in this direction for the special case where A is an interval 
have been obtained by Fujiwara [Fuj88| and Schmidt |Sch86j . The proof of theorem [6] 
below and conjecture at the end of the paper give additional links. 
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The goal of this paper is to communicate new sum-product estimates for polynomial 
and rational functions. Our results are of two kinds: The first kind are valid even for small 
sets (of size \A\ > p e for every e > 0). The second kind extend Vu's characterization to a 
more general setting, but are valid only for large sets (\A\ > p c for a fixed constant < 
c < 1). We thus expect the estimates for the small sets to be more useful in applications, 
whereas the large set results illuminate the general picture. 

Small sets. The results in this section are stated only for sets of size \A\ < ^fp. Modifi- 
cation for large \A\ involve no alterations in the fabric of the proofs, but would introduce 
much clutter. Moreover, for large \A\, the large-set results are not only more general, 
but yield sharper quantitative estimates. We did not optimize the numeric constants that 
appear in the bounds below because the results are very unlikely to be sharp for any value 
of the constants. 

Theorem 1. Let f £ F p pT] be a polynomial of degree d > 2. Then for every set A c¥ p 
of size | A\ < tJp we have 

\A + A\ + \f(A) + f(A)\ > \A\ 1+ T^. 

Note that by Ruzsa's triangle inequality (Lemma [S] below) this implies that for \A\ < 
y/p and any polynomial g of the form g(x,y) = x + f(y) with deg/ = d > 2 we have 
\g(A)\ » | J 4| 1+ 32-e d , which is a generalization of |HLS091 Theorem 3.1]. 

The next result is an extension of ([2]) to sets of any size for polynomials of degree two. 

Theorem 2. There exists an absolute constant c > such that whenever f S F p [X, Y] is 
a bivariate quadratic polynomial that is not of the form f = g{ax + by) for some univariate 
polynomial g, then for every A C ¥ p of size \A\ < y/p we have 

\A + A\ + \f(A)\^>\A\ 1+c . 

Our final small-set result is another generalization of the sum-product theorem itself: 

Theorem 3. Suppose f = Ylt=i a i xd% € ¥ P [X] is a polynomial with k terms, and an 
integer d > 2 satisfies di < d for all i = 1, . . . , k. Then for every positive integer r, and 
every set A C ¥ p of size p A l r d mr < \A\ < y/p we have 

\AA\ + \f(A) + f(A)\^\A\ 1+ c, 

where 

£ = (5000(r + k) 2 log 2 dy k . 

The main appeal of this estimate is that the dependence on the degree of / is merely 
logarithmic, which suggests that the exponents in all the sum-product estimates should 
not depend on the degree. Further evidence that the exponents in sum-product results 
should not depend on the degree is provided by the sum-product estimates for large subsets 
A c¥ q , which we present now. 
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Large sets. For a polynomial / £ ¥ q [Xi, . . . , X n ] and sets Ai,...,A n C F p write 
iV(/; Ai, . . . , A n ) for the number of solution to f(x\, . . . , x n ) = in Xj £ Ai. The com- 
monly used case N(f; A, . . . , A) will be abbreviated as N(f; A). More generally, if V is a 
variety in AJJ , then iV(V; Ai, . . . , A n ) is the number of points of V on A\ x • • • x A n . The 
principal result that generalizes ([2]) is 

Theorem 4. Ze£ f(x,y,z) be an irreducible polynomial of degree d which is not of the 
form P(ax + by, z) or P(x, y). Moreover, let A,Bc F q . Assume d<q^ 40 . Then 

\B\\A\ 4 i\A\{\A\/^q)^d-\ ifqV* < \A\ < d^q 7 / 10 , 

1 1 N(f-A,A,B) 2 [lAKq/lAD^d- 1 , if\A\>d^q 7/W - 

In particular, the inequality (|2j) holds (with slightly worse dependence on d) as witnessed 
by setting f(x, y, z) = g(x, y) — z, B = g(A) and noting that N(f; A, A, B) = \A\ 2 . 

The condition that / be irreducible is purely for convenience, since max(iV(/i), N(f2)) < 
N{fif2) < N(fi) + N{f2) holds for reducible polynomials. On the other hand, the condi- 
tion that / is not of the form P(ax + by, z) is essential because if / is of this form, then for 
A = {1, . . . ,n} and an appropriate B the result fails. Though the theorem is formulated 
only for polynomials, the questions about growth of rational function can be reduced to 
it, of which the following result is an example. 

Theorem 5. Let f{x) £ ¥ q (x), g{x,y) £ F q (x,y) be non-constant rational functions of 
degree at most d, and assume g(x,y) is not of the form G(af(x) + bf(y) + c),G(x), or 
G(y) with a,b,c £¥ q . Then if \ A\ > q? and d < g 1 / 50 , we have the estimate 



\f(A)+f(A)\ + \g(A,A)\ » 



f \A\(\A\/^d- 2 , ifq^ < \A\ < d*/W /10 , 
\|A|(g/|A|) 1 /3d-2 ) if\ A \ > d 8/5 g 7/l 0> 



The main feature of all the results above is the abelian group structure inherent in 
A + A and A A. That structure permits us to use sumset inequalities from the additive 
combinatorics, as well as the Fourier transform. The following result shows that for most 
polynomials /, the set f(A) always grows even in the absence of any group structure. 

Definition. Let f(x,y) £ K[sc, y] be a polynomial of degree d in the x-variable. Call 
f{x,y) monic in the x variable if the coefficient of the x d term is a non-zero constant of 
K. That is, 

f{x,y) = cx d + g(x,y), 
where c £ K \ {0}, and g(x, y) is of degree < d — 1 in the x-variable. 

Theorem 6. Let f(x,y) £ F q [x, y] be a polynomial of degree d which is non-composite, 
and is not of the form g(x) + h(y) or g(x)h(y). Suppose also that f(x, y) is monic in each 
variable. Then if \A\, \B\ > q 7 / 8 , 

\f(A,B)\ » d min(^/pp|,l^| 3/4 |S| 3/ V 7/16 ). 
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This result is to be compared with the estimate of Elekes and Ronyai over the real 
numbers: 

Theorem 7 ([EROOJ, Theorem 2). Let f(x,y) £ K(x,y) be a rational function of degree d 
which is not of the form G(g(x) + h(y)), G(g(x)h(y)) or G( p^^l ). Then there exists 
a constant c = c(d) > such that whenever \A\ = \B\ = n, 

|/(A,J3)|> d n 1+c . 

The statement appearing in [EROOj is quantitatively weaker, but the bound of n 1+c 
follows from the proof. Note that the case fz^gw^ arises because K is not algebraically 
closed. Indeed, £±£ = G{F{x)F{y)) where G{x) = Jfcj^ and F(x) = i±g. 

The rest of the paper is organized as follows. In sections [2] and [3] we gather analytic 
and algebraic tools used in the paper. Theorems HJ El an d [3] on small-set estimates are 
proved in section^ All the large-set results, apart from Theorem[6]on f(A, B), are proved 
in section El whereas Section [6] is devoted to Theorem [H It is followed by the proofs of 
the algebraic lemmas used throughout the paper. The paper ends with several remarks 
and a conjecture. 

2 Analytic tools 

We shall need a number of tools from additive combinatorics that we collect here. 

Lemma 8 (Ruzsa's triangle inequalities, |Ruz09| . Theorems 1.8.1 and 1.8.7). For every 
abelian group G and every triple of sets A,B,C C G we have 

\A±C\\B\ < \A±B\\B±C\, 

where the result is valid for all eight possible choices of the signs. 

Let s*A = A + A + -- -+ A where A appears s times as a summand. 

Lemma 9 (Pliinnecke's inequality, |Ruz09j . Theorem 1.1.1). For every abelian group G 
and every A C G we have 

\s * A - t * A\/\A\ < (\A ± A\/\A\) s+t , 
where the result is valid for either choice of the sign. 

Let A • A = {\a : a G A} be the A-dilate of A. The following result of the first author 
is used in the proof of Theorem [3] to obtain the logarithmic dependence on the degree. 

Lemma 10 ( |Buk08| . Theorem 3). If F is an abelian group and A C F is a finite set 
satisfying \A + A\ < K\A\ or \A - A\ < K\A\, then |Ai ■ A H h X k ■ A\ < K P \A\, where 

k 

P = 7 + 12j]bg 2 (l + |A i |). 

i=l 



5 



Lemma 11 (Szemeredi- Trotter theorem for ¥ p , |BKT04| ) . Let V and C be families of 
points and lines in of cardinality \V\, \C\ < N < p 2 ~ a with a > 0. Then we have 

\{(p,l) GPx£:pel}|«iV 3/2 - E 

for some e = e(a) > that depends only on a. 

For sets A, B in an abelian group, and a bipartite graph G C Ax B we put A +g B = 
{a + b : (a, b) G G} to denote their sumset along G. Similarly, A-q B = {ab : (a, b) S G} 
will denote their productset along G. 

Lemma 12 (Balog-Szemeredi-Gowers theorem, Lemma 4.1 in |SSV05| ). Let T be an 

abelian group, and A, B C T be two n-element sets. Suppose G C A x B is a bipartite 
graph with n 2 / K edges, and \A +g B\ < Cn. Then one can find subsets A' C A and 
B' C B such that \A'\ > n/(lQK 2 ), \B'\ > n/(AK), and \A' + B'\ < 2 12 C 3 K 5 n. 

In the proofs of Theorems S] and [5] we will use Fourier transform on . For that 
we endow F^ with probability measure, and its dual with the counting measure. Thus, 
the Fourier transform is defined by /(<!;) = — f(x) exp(2-7rzx • £), Plancherel's theorem 
asserts that 

the convolutions are defined by (/ * g){y) = J2 X f(x)g(y — x), and satisfy f * g = fg- 

3 Algebraic tools 

Here we record several results in algebraic geometry that are repeatedly used throughout 
the paper. 

Throughout the paper AJ| denotes the n-dimensional affine space over K, the algebraic 
closure of K. When the field is clear from the context, we write simply A n . For a 
reducible variety V C P N , define the total degree deg(V) to be the sum of the degrees of 
all irreducible components of V. 

Lemma 13 (Generalized Bezout's theorem, |Ful98j . p. 223, Example 12.3.1). Let V\ and 

V2 be two varieties in F N and let W be their intersection. Then 

deg(W)<deg(V 1 )deg(V 2 ). 

Much of this paper is about proving non-trivial upper bounds on the number of points 
on varieties in Cartesian products. Both for comparison, and because we need it several 
times in the proofs, we give an explicit "trivial bound". Recall that N(V; A\, . . . , A n ) 
stands for the number of points of V on A\ x • • • x A n . The following result is a general- 
ization of the well-known Schwartz-Zippel lemma to varieties. 
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Lemma 14. Let V be an m- dimensional variety of degree d in A n . Let A±, . . . ,A n C A 1 
be finite sets of the same size. Then 

N(V;A l ,...,A n )<d\A 1 \ m . 

Proof. The proof is by induction on m, the case m = being trivial. If V is reducible, 
then its degree is the sum of the degrees of its components. Thus, we can assume that V 
is irreducible. If d = 1, then V is a hyperplane, and the lemma is immediate. If d > 2, 
then the irreducibility of V implies that for each a £ A\ the hyperplane H a = {x\ = a} 
intersects V in a variety of dimension m — 1. Indeed, if dimF n H a = m, then V H a 
is a component of H a contradicting irreducibility. By Bezout's theorem (Lemma 1 13j) the 
degree of V n H a is at most d, which by induction implies that 

N(V; A u ...,A n )=J2 N (. v n H a ; A 2 , . . . , A n ) < |Ai| ■ d\A 2 \ m ~\ □ 

Let K be an algebraically closed field of characteristic p > 0. A rational function 
f(x,y) G K.(x,y) is called composite if there exist rational functions Q(u),r(x,y) with 
Q(r(x,y)) = f{x,y) and degQ > 2. 

Lemma 15 (Bertini-Krull theorem, jSchOOj . p. 221). A polynomial f(x,y) € K.(x,y) of 
degree at most p — 1 is composite if and only if for a generic (cofinite) set ( £ I, the 
variety f(x,y) = t is reducible. 

We also use the explicit bound on the Fourier transform over a curve due to Bombieri: 

Lemma 16 (Theorem 6, |Bom66j ). Let P(x±,X2) be a polynomial of degree d over ¥ q 
without linear factors, and define the set S C to be the zero set of P. If £ is any 
non-zero additive character ofW'i, we have the following bound: 



< 2d 2 q^. 



4 Small sets 

We first prove that either A + A or f(A) + f{A) grows. The proof is inspired by Weyl's 
differencing method for estimating exponential sums. For deg/ = 2 differencing reduces 
the problem to the standard sum-product estimates, whereas for deg/ > 3, differencing 
lets us replace / by a polynomial of lower degree. 

Proof of Theorem^ The proof is by induction on k. We could use theorem [2] as our base 
case, but its proof is particularly simple when restricted to this special case, so we give it 
here. 
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Suppose f(x) = ax 2 + bx + c is a polynomial of degree two, which is to say a / 0. 
Assume that \ A + A\ + \f(A) + f(A)\ < A\A\. Then by the triangle inequality (Lemma |HJ) 
we have \f(A)-f(A)\ < \f(A) + f(A)\ 2 /\A\ < A 2 \A\ and \A-A\ < \A + A\ 2 /\A\ < A 2 \A\. 
As f(x) — f(y) = a(x — y)(x + y + b/a), it follows that there are at least \A\ 2 solutions to 

uw = z, with u£ A- A, w <E A + A + b/a, z G (1/a) • {f(A) - f{A)). 

By the Balog-Szemeredi-Gowers theorem (Lemma I12p applied to A — A and A + A + 6/a 
in the multiplicative group F* we infer that there are Ai C ^4 — ^4 and A 2 C A + A + b/a 
of sizes |Ai| > A 2 |A|/(16A 8 ) and \A 2 \ > A 2 |A|/(4A 4 ) such that \A X A 2 \ < 2 12 (A 4 ) 5 A 2 |A|, 
and hence 

\A 2 A 2 \ < 2 24 A 4 Vl < 2 26 A 46 |A2| 

by the triangle inequality. Since A 2 + A 2 GA + A + A + A + 2b/ a, Pliinnecke's inequality 
(Lemma El) implies 

\A 2 + A 2 \ < \A + A + A + A\ < A A \A\ < 4A 6 |A 2 |- 

These inequalities contradict (HD unless 2 26 A 46 > |A 2 | 1/12 > (| AI/4A 2 ) 1 / 12 . Simple arith- 
metic gives A » I ^4. 1 1/554 _ 

Hence, we know that whenever \A\ < yjp, and / is quadratic, we have \A + A\ + 
\f(A) + f(A)\ > C|A| 1+1 / 554 for some constant < C < 1. We shall prove that whenever 
deg/ = d > 3, we have \A + A\ + \f(A) + f(A)\ > C\A\ 1+ T^ with the same constant 
C. Suppose that deg/ = d > 3 and the claim has been proved for all polynomials of 
degree d — 1. Assume that \A + A\ + \ f(A) + f(A)\ < A\A\. Let t be any number having 
at least |A| 2 /|^4 — A\ > A _2 |A| representations as t = a± — a 2 with ai,a 2 G A. Define 
A' = {a £ A : a + 1 £ A} and g{x) = f(x + 1) — /(x). From the choice of i it follows that 
\A'\ > A~ 2 |A|. Pliinnecke's inequality (Lemma|9} tells us that 

\g(A') + g(A')\ < \f(A) + f(A) - f(A) - f(A)\ < \f(A) + f(A)\ A /\A\ 3 < A A \A\ < A 6 \A'\. 

We also have \ A' + A'\ < + < A\A\ < A 3 \A'\. However, g is a polynomial of degree 
d — 1, and by the induction hypothesis this implies A 6 > C\A\ 16V-1 . Since C < 1, we 
have C 1 / 6 > C, and the induction step is complete. □ 

We note that the argument in [Bou05j for x 2 + xy does not seem to generalize to 
an arbitrary quadratic polynomial, as that argument crucially depends on x 2 + xy being 
linear in y, and so our proof of Theorem [2] is again based on the idea of differencing. 

Proof of Theorem Assume that f(x, y) = ax 2 + by 2 + cxy + dx + ey is a non-degenerate 
quadratic polynomial, and \A + A\ + |/(A)| < A| A| . The Cauchy-Schwarz inequality 
implies that the equation 

ax\ + by\ + cxiyi + dxi + eyi = ax\ + by\ + cx 2 y 2 + dx 2 + ey 2 , x x , x 2 , y x , y 2 G A 
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has at least A 1 \A\ 3 solutions. Changing the variables to v\ = x\ + X2, vi = D\ + J/2, 
u\ = x\ — x 2) U2 = y\ — V2 we conclude that there are at least A - 1 1 3 solutions to 

au\V\ + bu2V2 + \c{u2V\ + U1V2) + dni + eu2 = 0, U\,U2 € A — A, v\ e A + A. 

Rewrite the equation as 

ui(avi + \cv2 + <i) + U2(bv2 + \cv\ + e) = 0, ui,u 2 £ A — A, v\,v 2 £ A + A. (3) 

Consider 3(^1,^2) = Q " 1+ 2 CT2+d _ Suppose g is a constant function. Then ab — jc 2 = 0, 

bv2+2 cv l+ e 

which implies that there are s,t G F p 2 such that ax 2 + 6y 2 + cxy = (sx + ty) 2 . Hence 
g(v\,V2) = f^+ff^+fi an d it is evident that dx + ey is a constant multiple of sx + 
ty, contradicting non-degeneracy of /. We conclude that g(vi,V2) cannot be a constant 
function. 

Without loss of generality we assume that g depends non-trivially on v\. Call v € 
bad if g(vi,v) is a constant function. As v is bad only when the linear functions in the 
numerator and denominator of g are proportional, there is at most one bad v, which we 
denote t;bad- When specialized to V2 = «bad equation ([3]) takes the form au\ + [3ii2 = 0, 
where a and (3 are constants, that are not simultaneously zero. Thus there are at most 
I A — A\\A 4- A\ solutions to (|3|) with V2 = ^bad- 

Choose a value for V2 for which there are at least N = A~ 1 \A\' i /\A + A\ — \A — A\ 
solutions to (|3|) and which is not bad. Since we can assume that A < \A\ 3 ^/2, it follows 
that N > iA -1 |A| 3 /|A+^4| > iA~ 2 |^| 2 . Let L u>Vl denote the line in F 2 with the equation 

(ui - u)(avi + \cv2 + d) + U2(bv2 + \cv\ + e) = 0. 

Since g is not a constant function of vi, the slope of L UtVl uniquely determines vi, from 
which we conclude that the family 

£ = {L u , Vl ■ n G A + A, u G A - A} 

contains \A + A\\A — A\ distinct lines. Consider the point set V = [{A — A) + (A — A)) x 
(A — A). Each solution to ([3]) yields \A — A\ incidences between V and C, one for each value 
of u. By Lemmas [8] and [9] we have \V\ < A 10 |^4| 2 and \C\ < A 3 |A| 2 . Szemeredi- Trotter 
theorem (LemmadU) implies that A _1 |A - A\N < (A 10 |^| 2 ) 3 / 2_e . Since \A - A\ > \A\, 
we are done. □ 

The sum-product estimate for \AA\ + \f(A) + f(A)\ is more delicate than the results 
above. Similarly to the proof of Theorem [Tj the aim is to use the upper bounds on 
f(A) + f(A) + • • • + f(A) to obtain an upper bound on g(A) + g{A) for some simpler 
polynomial g. However, now 'simpler' means 'having fewer non-zero terms', and one 
needs to add far more copies of f{A) to obtain a simpler g. If A was a product set 
itself, A = BC, then our aim would be to find 61,62,- •• G B so that the polynomial 
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h(x) = f(b\x) + ffax) + • • • has fewer terms than / has. If \B\ > p £ we can hope to use 
the pigeonhole principle to find h(x) and h'(x), in which one of the terms is the same. 
Then the polynomial h(x) — h'(x) would have fewer terms. Unfortunately, it might happen 
such h(x) and h'(x) are always equal, in which case h(x) — h'(x) = 0. However, as BB is 
small, for each fixed A we can find a large set B' C B and an element g such that gb x £ B 
for all b £ B' . Using B' in place of B then permits us to use not only the terms of the 
form f(bx), but also of the form f(gb x x). As it turns out, that suffices to complete the 
proof. Regrettably, A is not necessarily a product set, and that requires us to work with 
multiplications along a graph, introducing additional technical complications. 

The following lemma is used to find an analogue of B' in the sketch above. Recall that 
A • A = {Xa : a £ A}. 

Lemma 17. Suppose Ai,...,A r are non-zero integers, and T is an abelian group. Fur- 
thermore, assume that A C T satisfies \ A + A\ < K\A\. Let 

r 

P = 43^1og 2 (l + |Ai|). 

i=l 

Then there is a set B C A of size \B\ > ^K~ P \A\ and elements g\,. . . ,g r such that for 
every b £ B the set 

{a £ A : a + Xi ■ b + gi £ A for all i = 1, . . . ,r} (4) 
has at least ^K~ P \A\ elements. 
Proof. For given g\ , . . . , g r £ F define 

S(gi, ■ ■ ■ ,g r ) = ^#{ a £ A : a + Xi ■ b + gi £ A ior all i = 1, ... ,r}. 

Summing over all (g±, . . . ,g r ) £ T r we obtain 

^2 S(gi,. . .,g r ) = ^2 #{(gi,- ■ -,g r ) £ T r : a + A^ • b + gi £ A for alH = 1, . . . ,r} 

= ^ \A\ r = \A\ r+2 . 

a,b£A 

Since S(gi, . . . ,g r ) = unless gi £ A — A — Xi ■ A, there is a way to choose gi, . . . ,g r so 
that 

r 

S{g 1 ,...,g r )>\A\ r+2 J[\A-A-X l -A\~ 1 

i=l 

> lA^K"^ 1 ( 7 + 12 + 12 + 12 iog 2 (i+|Ai|)) 

> K~ P \A\ 2 
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by Lemma \W\ and the inequality 31 + 121og 2 (l + A) < 431og 2 (l + A) valid for A > 1. 
Having chosen gi,. . . ,g r , define B to be the set of all b G A for which the set in Q has 
at least ^-fC _p |^4| elements. Since the elements b G A \ B contribute at most ^-ftT -P |^4| 2 
to S(gi, . . . ,g r ), the lemma follows. □ 

Let pt(xi, . . . , x n ) = x\ + • • • + x l n be the t'th power sum polynomial. 

Lemma 18. Suppose IK is a field, and < t\ < ti < ■ ■ ■ < t r < char IK are integers. 
Let w: Aj£ —> Aj^ be given by w{x\, . . . ,x r )i = Pti(xi, ■ ■ ■ ,x r ). Furthermore, assume that 
there are sets Si,...,S r Cl of size n each, and S = Si x • • • x S r is their product. Then 
there is a set S' C S of size \S'\ > |5| — n r_1 Yli^i such that for all x E S' the number of 
solutions to 

w(x) = w(y) with y G S' 

is at most Y\ i t{. 

Proof. The Jacobian determinant of w is 

J(x) = dei(dp t Jdxj)ij = det(tjx^~ 

The polynomial J is of degree — an d it is non-zero since its degree in xi is 

t r — 1 < char K. By Lemma J vanishes in at most n r_1 ^2^(U — 1) points of S. Thus 
the set S' = {x G 5 : J{x) ^ 0} is of size \S'\ > \S\ - n r ~ l ^U- 

For each x £ S' the variety V x = w~ 1 (w(x)) is zero-dimensional. The bound on the 
number of points of V x then follows from Bezout's theorem (Lemma ll3p . □ 

Proof of Theorem^ The proof is by induction on k. Suppose k = 1 and / = ax d . Let 
B = {x d : x G A}. Then \B\ > \A\/d. The condition \AA\ + \f(A) + f(A)\ < A\A\ 
implies \BB\ + \B + B\ < dA\B\. Then jT} implies dA > (l^l/d) 1 / 12 , which together with 
| ^4 1 > d 20 establishes the base case. 

Suppose / = Y^i=i o-iX di and k > 2. If g = gcd(di, . . . , dk) / 1, then upon replacing A 
by {a 9 : a G A} and / by ^ aiX di ^ 9 the problem reduces to the case gcd(di, . . . ,dk) = 1. 
Assume + + < A\A\. For 6 G T define A 6 = {a G A : ba G A}. Since 

gcd(di, . . . , dfc) = 1, there is a pair of exponents di,dj such that d{ and ct,- are not a power 
of the same integer. Without loss of generality we may assume that these two exponents 
are d\ and c?2- Let 

\ i = d r 1 ' i 4 for i = l,...,r. (5) 

Then by the choice of d\ and cfe, all the A's are distinct. Lemma [17] then yields a set 
B C A of size |JB| > ^A~ P |A| and elements gi,...,g r such that if we define 

A b = {a G A : gib Xi a G A for all z = 1, . . . , r}, 

then for every b G B 

\A b \ > ±A~ P \A\, 
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where P < 43r log 2 (l + d r ) < 100r 2 log 2 d. Thus, 

\A bl n---nA br \ = ^2(#{b€A:aeA b }) r 

bi,...,b r eA a&A 

(EaeA(#{beA:aeA b }Y 



> 



\A\r- 1 

> (iA- p ) 2r i^r +i . 

Define a graph G as follows. Its vertex set is 

V(G) = {(&!,. . - A) G 5 r : | A, n • • • n A br \ > 2' 2r ~ 1 A~ 2Pr \A\}, 

and the pair (h,...,b r ), (b r+1 , . . . , b 2r ) is an edge of G if | A bi n- ■ -nA b2r \ > 2~ 4r - 3 A- 4Pr \A\. 
Since |V(G)||A| + \B\ r (2~ 2r ^ A~ 2Pr \ A\) > (|A- p ) 2r |A| r+1 , the number of vertices in G 
is |V(G)| > 2~ 2r ~ 1 Ar 2Pr \A\ r . Moreover, the graph G contains no independent set of size 
2 2r+2 A 2Pr . Indeed, if . . . , &i,r)}^=i C ^(G) is an independent set of size m, then 



\A\ > 



1=1 



> m2- 2r - 1 A- 2Pr \A\ - ( ™ )2- 4r - 3 A- 4Pr |A|, 



implying m < 2 2r+2 A 2Pr . 

For each b 6 FJj define b A = (6 A ,...,6 A ). Let = (b dl , . . . , b dk ), and for each 
b = (61,..., b r ) £ V(G) define u(b) = + • • • + u(b r ) = ( Pdl (b),p d2 (b), . . . ,p dfc (b)). 

Now the argument breaks into two cases. In the first case, we will reduce the problem to 
the case of a polynomial with at most k — 1 terms, whereas in the second case, we will 
show that a certain sumset associated to V(G) is too large. 

Case 1: Suppose there is an index i £ [r] and bb' £ -E(G) such that «(b Ai ) 7^ u(b /Ai ), 
but u(b Ai ) and u(h' Xi ) share a coordinate, that is p^(b) = p^(b') for some j. Then 

5^) = f( gi b^x) + ■■■ + f( gi b^x) - f(gib'^x) + ■■■ + f( 9i b' r Xi x) 

k 

is a non-constant polynomial with one fewer term than /. Moreover, if we define A' = 
4n-n\n^n-n A K then g{A') + 9 {A') C (2r) * /(A) - (2r) * f{A). By the 
definition of the graph G, 

, ,/, \A\ 
A \ > 1 1 

I I — 2 2r +' i A 2Pr ' 

A simple, but tedious calculation shows that if A is small enough for the conclusion of the 
theorem to fail, then \A\ > d^ T p A / r implies \A'\ > d 40 ( r + 1 ) p 4 / '( r + 1 ) ) permitting us to apply 
the induction hypothesis with r and k replaced by r + 1 and k — 1 respectively. Then, 
since 

<A 4 ^| = (A 4 ^)L4'|, 
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the induction hypothesis implies that A 4r |^y > \A'\ £ , where e = (5000(r + k) 2 log 2 d) k ~ l . 
Another tedious calculation shows that this implies the desired lower bound on A. 

Case 2: Suppose that for all i E [r] and all edges bb' 6 E(G) either all coordinates of 
ii(b A ') and u(h' Xi ) are equal, or all of them are distinct. In particular, p^ 1 (b Ai ) = Pd 1 (b /A ') 
and pd 2 (h Xi ) =p^ 2 (b /Al ) happen simultaneously. Let 

W(b) = ( Pd r d2 (b),p d r-l d 2(h), . . . , Pd r + l(b)). 

Since p ( i 1 (b Ai ) = p^— i+ ij (b) and pd 2 {b x ') = p d r-i d i+i(h), it follows that if bb' G E(G), 
then either w(h) = w(h') or w(h) differs from w(h') in all coordinates. 

By Lemma M there is a set X C V(G) of size at least \V(G)\ - \A\ r - 1 rcT +1 > ^\V(G)\ 
such that \X n ^ -1 (^( b ))l < rfr2+r for a11 bel. Since \A\ > (i 40r , and 2 r > r either A 
is sufficiently large to stop the argument here, or we have 



1*1 > \V(G)\ - \A\ r ~ 1 rd r+1 > \A\ r - 1 {2- 2r - l A~ 2Pr \A\-rd r+l ) > \A\ r 2~ 2r - 2 &- 



2Pr 



By the pigeonhole principle there is an h € F p such that the hyperplane H = {xi = h} C 
Fp contains w(h) for at least |*|/p values of b £ X. Pick X' C X such that the points 
{w(h)}b £ x' all lie in and are distinct, and such that 

, „ 1 IXI 2- 2r ~ 2 A- 2Pr \A\ r 
X' > ■ — > 



d r2+r p pd r2+r 

Since such an X' is an independent set in G, we conclude that \X'\ < 2 2r+2 A 2Pr , and the 
theorem follows. □ 



5 Large Sets 

The results in this section require a more systematic use of the idea of differencing appear- 
ing in the proof of Theorem [TJ and used throughout the previous section. The differencing 
is a special instance of a general strategy that consists in repeatedly applying the Cauchy- 
Schwarz inequality to increase the number of variables involved. Geometrically a single ap- 
plication of the Cauchy-Schwarz corresponds to taking fiber products of two varieties. For 
illustration, consider the problem of showing that f(A) grows. Let g(x, y, z) = f(x, y) — z. 
It suffices to show that N(g;A, A,C) is much smaller than \A\ 2 for every set C C ¥ q 
of size |C| = \A\. This is a problem of bounding the number of points of a particular 
variety V = {g(x, y, z) = 0} on the Cartesian product A x B x C. The Cauchy-Schwarz 
inequality tells us that the number of points of V = {g(x,yi, z±) = 0, g(x, 2/2, ^2) = 0} on 
i4x5xBxCxCisat least N(g; A, B, C) 2 /\A\, and thus it suffices to establish a non- 
trivial upper bound for the number of points on V' . The variety V' is the fiber product 
of V with itself for the projection map on the first coordinate. However, in general one 
can use less trivial fiber products. 
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The problem about sets, whether f(A) is large, reduces to the problem about count- 
ing points on the variety {f(x,y) — z = 0} on Cartesian products, but to pass in the 
other direction the only tool currently available is the Balog-Szemeredi-Gowers theorem 
(Lemma ll2j) . which applies only for linear /. It is because of this we are forced to work with 
varieties rather than sets. We note however that the analogues of the sumset inequalities 
that were extensively used in the previous section are easy to prove with Cauchy-Schwarz 
as above. For instance, let g(x, y,z) = x + y — z, and A = B = C , then the lower bound 
for the number of points on V = {x + y — z = 0} implies a lower bound for the number of 
points on V = {x + y\ — z\ = 0, x + y2 — zi = 0}, which upon elimination of x variable, 
yields a lower bound for the number of points on V" = {y\ + zi — y<i — z\ = 0}. This 
relation corresponds to the Pliinnecke inequality for A + A — A. For general analogues of 
Ruzsa's triangle inequalities and Pliinnecke's inequality see |Raz07| . Another interesting 
example of a systematic use of fiber products in additive combinatorics is in [KT 99j . 

To establish the estimates on N(f; A, A, B) promised in theorem [4] we will use the 
special case of N(f; A, A, A) as a stepping-stone. The general result will be deduced via 
an application of Cauchy-Schwarz as explained above. 

Lemma 19. Suppose n > 2 and f(x) is an n-variable polynomial of degree d with no 
linear factors, and A C ¥ q . Then if \A\ > q 1 ^ 2 and d < q l l^ n , we have the estimate 

, A A , \A\ n [\A\(\A\/^qf/H-\ ,/Q 1 /2<| A |< d 4/5 {? 7/10 ) 

1 1 N(f-A) n \|A|(g/|A|) 1 /3 d -i/3 ) if \ A \ >d4/ 5(? 7/io_ 

Proof. We will first establish the case n = 2, and then reduce the general case to it. 

Case n = 2: The proof is a standard Fourier-analytic argument. Namely, let B = 
A + A, and S C be the set of solutions to f(x) = 0, and let C = S n A 2 be the subset 
of the solutions with coordinates in A. Since C + {A x A) C B x B it follows that 

. |C|-|A| 2 

\XBxB * X-Ax-A, XS)¥l > • 

Using Plancherel's theorem, and evaluating the term £ = separately, we obtain 
\C\-\A\ 2 _ „ „ . \B\ 2 \A\ 2 ^ „ , , rt . , 

7 < [XBxB ■ X-Ax-A,XS)f2 = e h y^XBxB(€)X-Ax-A{ti)Xs{-ti)- 

Lemma [14] gives the bound \ S\ < dq. Moreover, since / has no linear factors, by Lemma [TBI 
Xs(€) *C d 2 q~ 3 / 2 . Therefore, by Cauchy-Schwarz and Parseval 

\C\-\A\ 2 ,\B\ 2 \A\ 2 ]2 „ 3/2 ..„ 

i <d 111 1 +d 2 q 6/2 ( XBxB , X-Ax-A }f2 

<dM^+dV 3/2 ||XBx B || 1/2 ||X-Ax-^|| 1/2 
q° 

\B\ 2 \A\ 2 ,o\B\\A\ 
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Since |C| = N(f; A, A), we deduce that either J^l^l 2 > h<l\ A \ 2 or \ B \m¥a) ^ ~ 



\A\ 2 

In both cases, at least one of \B\ = \A + A\ and J/ f . A \ is as large as claimed, and the 



N(f;A) I ""I - 2(fyi yi l U1 l-"ljV(/;A) - cPq 1 / 2 ' 



result follows. 

Case n > 2: Having established the n = 2 case, we proceed by induction on n. 
Note that we can suppose / is irreducible. Now by Lemma [32| one can pick a variable, 
say x\, such that for all but d n (n — 1) 'bad' elements c G ¥ q of x±, the polynomial 
fc = f(c, X2,x%, . . . , x n ) has no linear factors. Then 

N{f-A)<d n {n-l)\A\ n - 2 + N(f c ;A). 

'good' c 

Now, if the first term on the RHS is bigger, we are done. Else, we have: 
\A + A\ + -^—>\A + A\+ 



N(f,A)~ l N(f c ,AY 

and we are done by induction. □ 

Lemma 20. Let W be an m- dimensional irreducible variety of degree d > 2 in Ap , and 
A C F q . Then if \A\ > q 1 / 2 and d < q l /^ n , we have the estimate 

\A A\ |Ar+1 imA\/VQ) 1/2 d-\ i fq 1 / 2 <\A\<d*/W /1 °, 

1 1 N{W-A) n \\A\(q/\A\) 1 / 3 d- 1 , if \A\ > d^q 7 / w . 

Proof. By Lemma [33] we can find a subset of m + 1 coordinates yi,y.2, ..,y m +i on which 
to project W such that the image lies in an m-dimensional variety W of degree d! > 2, 
which is thus defined by an equation f(y) = for a polynomial / of degree d' . By 
Corollary [31] and Lemma [13] there is a proper subvariety V C W of degree at most 
d 3 n on the complement of which the projection from W to W is at most (d/d')-to-l. 
Hence, JV(W;A) < N(V;A) + (d/d')N(W; A). As dimV < m - 1, Lemma [H yields 
iV(V; A) < d 3 n|A| m " 2 . Thus if (d/tf)iV(W; ^) < N(V;A), then we are done. Otherwise, 

\A + A\+ ',' >\A + A\ + 



N{W;A)-' 1 2(d/d / )^(W // ;-4) 
and the results follows from the previous lemma. □ 

For the case where A is an interval, similar results to Lemma [20] were obtained by 
Fujiwara |Fuj88| and Schmidt [Sch86] . 

The preceding lemma implies Theorem U] concerning the lower bound on \A + A\ + 

|B||A| 4 
N(f;A,A,B) ■ 

Proof of Theorem [^J Let V C A| be the 3-dimensional variety 

f(xi,x 2 ,x 5 ) = f(x 3 ,x 4 ,x 5 ) = 0. 
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Note that V is the fiber product of f(x, y, z) = with itself along the projection to the last 

A,A 
JSl 



coordinate. By the Cauchy-Schwarz inequality, N(V; A, A, A, A, B) > N (f>^A,B) • Let 



U C A 4 ^ be the projection of V onto {X5 = 0}, and W be the Zariski closure of U. Note 
that for any point u € U, one of two things can happen: the preimage of u in V consists 
of at most d points, or the preimage in V is all of . But the latter can only happen 
for at most d 2 points, by Corollary [29) Thus we arrive at the following upper bound on 
N(f;A,A,B) 

N (f'A,A,B) 2 < N (y. Aj Aj Aj Aj 5) < dN{ jj. A } + d 2 q 
\B\ 

Note that if dN(U; A) < d 2 q, then the lemma follows. Thus we may assume 

N(f;A,A,B) 2 



\B\ 



< 2N(U;A) < 2N(W;A). 



An upper bound on N(W; A) will now follow from the preceding lemma. Since the 
degVF < d 2 , the condition d < g 1 / 5n of the lemma holds, and to apply the lemma it 
suffices to check that W does not contain a hyperplane. 

We argue by contradiction. Suppose a hyperplane L = {01X1+02X2+03X3+04X4+05 = 
0} is a component of W. Since the surfaces 

S c = {f(xi,x 2 ,c) = /(x 3 , x 4 , c) = 0} 

fiber W, the varieties S C D L fiber L. So by the dimension count, S c n L is generically 
2-dimensional. Thus for a generic c £ A , S c has a component in L. This can happen 
only if a\x\ + 02X2 is constant on a component of /(xi,X2,c) = 0, which means that 
/(xi,X2,c) = contains a line with slope s = = ^~. But having slope s is a Zariski-closed 
condition, and since V is irreducible, all points in V must have slope s, which contradicts 
the assumption that /(x, y, z) is not of the form P(a\x + a 2 y, z). 
Therefore we can apply the previous lemma to get the bound 

|i?|L4| 4 [\A\{\A\/^qfl 2 d-\ if gV2 < \A\ < d^q 7 / 10 , 

1 1 N(f;A,A,B) 2 [lAKq/lAD^d- 1 , if |^| > d 4 / 5 g 7/10 - 

□ 

We conclude by presenting the deduction of Theorem[5]on growth of \ f{A) + f(A)\ + 
\g(A, A)\ from Theorem HI 

Proof of Theorem^ Since /(x) has at most d poles, we may assume that A contains none 
of them. 

The idea in the proof is to create a polynomial having many solutions on f(A) and 
apply the previous lemma. This motivates us to define fields K = ¥ q (x,y) and = 
F 5 (/(s), f(y))- It is easy to see that K f C K and [K : K f ] = deg(/) 2 . 
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Let H (x, y, t) be the minimal polynomial for g(x, y) over Kf . Note that H (x, y, g(x, y)) = 
0. Write H(x,y,t) = S(f(x),f(y),t). Then S(x,y,t) is an irreducible polynomial with at 
least \ f(A)\ 2 roots on f(A) x f(A) x g(A,A). If S(x,y,t) is not of the form P{ax + by,t), 
Theorem |4] yields 

i ha) + fr^i 4 l^- 4 )!!^)! 4 ^ / I^KMA/*) 1 ^- 1 , if i 1/2 < \a\ < D^qV 10 , 

lH ' J{ )l (\f(A)\ 2 ) 2 \\A\{q/\A\)^D-\ if L4| > Z^V 710 , 

where D = deg S. Since an indeterminate 3- variable polynomial S' of degree D' has ( D 
coefficients, and the condition that the rational function S'(f(x),f(y),g(x,y)) vanishes 
is a system of ( 4rf ^ +2 ) linear equations, there is a non-zero polynomial S' satisfying 
S'(f(x),f(y),g(x,y)) = 0, of degree at most D\ provided 

Since S is irreducible, S divides «S" implying D < D' < 12d 2 . 

If S(x,y,t) = P(ax + by,t), then it would force P(af(x) + bf(y),g(x,y)) = 0, implying 
that g(x,y) is algebraic over ¥ q (af(x) + bf(y)). If that is so, g(x,y) is of the form 
G{af{x) + bf(y) + c), G{x), or G{y) by LemmaEl □ 



6 Growth of f(A, B) for very large sets A 1 B 

In this section we prove Theorem [6] on growth of f(A,B) without any assumptions on 
A + A. Excluding several algebraic lemmas, whose role is auxiliary to the main flow of 
the argument, the proof is not long. However, the shape of the proof might be mysterious 
without further explanations. 

In the previous section we saw a way to use the smallness of A + A to reduce the task 
of bounding N(f;A) to estimating Fourier transform of the curve {/ = c}. The latter 
was achieved by invoking the celebrated Weil's bound. In the absence of any assumption 
on A + A, we need to dispose of the Fourier transform. The motivation for our approach 
comes from Gowers U 2 norm, which is a substitute for the Fourier transform in additive 
combinatorics. If instead of smallness of A+A we assume smallness of another polynomial 
g(A, A), then treating g as 'addition' and its 'inverse' (which might exist only implicitly) 
as 'subtraction', we can create an analogue of the U 2 norm of g. Thus we require a 
polynomial g for which g(A) is small. It turns out that often the polynomial / itself can 
fulfill the role of g. Moreover, assuming smallness of f(A) we can create many varieties V 
for which N(V) is small using fiber products as in the previous section. 

The substitute for the Fourier transform is not enough if there is no analogue for Weil's 
bound. That final ingredient comes from the bound on the number of points on irreducible 
varieties. It should be noted that we will not use the full strength of Deligne's bound on 
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the number of such points, but rather an earlier theorem of Lang and Weil |LW54j that is 
a consequence of Weil's work on curves. 

An alternative perspective on the argument that follows is that it is about turning 
sums over large subsets A c¥ q into complete sums over ¥ q , which are easier to study by 
algebraic means. A desirable estimate on such a sum is |X)a;eA &( x )\ ^ J2 x eF q ^( x ) > but 
in general it holds only if S(x) is positive. The terms S(x) that appear in our sums are 
not always positive, but the Cauchy-Schwarz inequality reduces the problem to a sum of 
positive terms, which can be completed: 

i 

2 





< \A\* 




i 

2 

<\A\i 


E s ^ 2 













The sum on the right is now a complete sum. 

Proof of Theorem® Set C = f(A,B). Suppose |C| < q/2, for else there is nothing to 
prove. We start with the inequality 

l^4| 2 |£| 2 < E A(x 2 )B(y 2 )C(z 1 )C(z 2 )C(f(x 1 ,y 1 )), 

where we identify sets with their characteristic functions, that is A(t) = 1 if t G A, and 
otherwise, and likewise for B and C. 

As in the usual application of Gowers U 2 norm, we will need to replace one of the sets 
by a function of mean zero. Let S(t) = 1 for t G C, and S(t) = — q _\Q\ if t $ C. The 
function S(t) is clearly of mean 0. Since / is monic in y, for each x and z there are at 
most d solutions to f(x,y) = z. Thus there are at most c£|.A||C| solutions to f(x,y) = z 
with x 6 A, y 6 ¥ q and z £ C, and we have 



\A?\B\ 



& 



\A\\B\\C{ 
q-\C\ 



< 



E 

/(zi,2/2)=2l,/(z2,2/l)=Z2 



A(x 2 )B(y 2 )C(z 1 )C(z 2 )S(f(x u y 1 )) 



If |C| 3 > |A||i?|gr/4(i 2 , we are done. Otherwise, the first term dominates and we obtain 
|A| 2 |i?| 2 <2 E A(x 2 )B(y 2 )C(z 1 )C(z 2 )S(f(x 1 ,y 1 )) 

f(x 1 ,y 2 )=z 1 ,f(x 2 ,yi)=Z2 

We proceed to 'clone' variables by applying Cauchy-Schwarz with respect to y 2 ,x 2 to 
obtain 



l^l 4 ^! 4 < 4 E {A\x 2 )B 2 (y 2 )) E E C(z 1 )C(z 2 )S(f(x 1 ,y 1 )) 

V2,x 2 \f(xi,y2)=z 1 ,f(x2,yi)=Z2 



V2,X2 
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where the inner sum on the right side ranges over z\, z 2 , x\, y\. Expanding, we get 
\A\ 3 \B\ 3 <4 C(z 1 )C(z 2 )C(z 3 )C(z i )S(f(x 1 ,y 1 ))S(f(x 3 ,y 3 )). 

f(xi,y2)=zi,f(x2,yi)=Z2 

f(x3,y2)=Z3,f(x2,V3)=Z4 

We come to our final application of Cauchy-Schwarz to this sum, this time with respect 
to z 1 ,z 2 ,z 3 ,z i : 

6 < 16 Yl 5(/(x 1 ,y 1 ))5(/(x 3 ,y3))5(/K, y ;))S(/(x / 3,^)). 



V 



f(xi,y2)=z 1 J(x2,yi)=z 2 

f(x3,y2)=Z3,f{x2,y3)=Z4 

f( x 'i ,y'2)= z i >f( x 2 ,v'i)= z 2 
f{x3>y'2)= z 'iJ( x 2>y'3)= z i 

Or more succinctly, 

|4|6|D|6 

< 16 E 5(/(x 1 , yi ))5(/(x 3 , ? /3))5(/(x' 1 , y i))5(/(x3, y 3)). 

.f( x i,y2)=f(x' 1 ,y' 2 ),f(x 2 ,yi)=f(x' 2 ,y[) 
f( x 3,y2)=f(x' :i ,y 2 )J(x2,y3)=f( x 2,y'3) 

We next split the sum into many. Geometrically, the summation is over the variety 

f(xi,y 2 ) = f{x' 1 ,y' 2 )J(x 2 ,y 1 ) = f(x 2 ,y'i)) 

f(x 3 ,y 2 ) = f(x' 3 ,y' 2 ),f(x 2 ,y 3 ) = f(x' 2 ,y' 3 ) J 

Define 0: V -> A 4 by </>(. . . ) = (f(x 1 ,yi)J(x 3 ,y 3 ),f(x / 1 ,y' 1 )J(x' 3 ,y' 3 )), and let V tl ,t 2 ,t3M c 
V be the variety 4>~ l {t\, t 2 , t 3 , i 4 ). Then we can rewrite the preceding inequality as 

|4|6|R|6 _ 

< 16 £ 5(t 1 )5(t 2 ) < 5(t 3 )5(t 4 )iV(y tl , t2 , t 3^;F ? ). 

Heuristically, one expects such a complicated variety as Vt lt t 2 ,t 3 ,u to be usually irre- 
ducible and of dimension 4, since it is given by 8 equations in 12 variables. If that was 
indeed the case, then by |LW54j N(V tl ,t 2 ,t3,Ui ^q) = 9 4 + Od{q 7 ^ 2 )- Since S is of mean 
zero, that would give the estimate of the theorem. 

Formally, we appeal to Lemmas [23] and [25] that tell us that V is 8-dimensional, irre- 
ducible and there is a Zariski-dense open set U C A 4 such that whenever (ii,i 2 ,i 3 , i 4 ) € U, 
the variety Vt 1: t 2 ,t3,t 4 is 4-dimensional and irreducible. Let Y = A 4 \U. Since dimcf) (Y) < 
7, the variety <j)~ l (Y) contains at most Od{q 7 ) points in F 42 and thus 

« £ 5(t 1 )5(t 2 )5(t 3 )-S(t 4 )iV(y tl , t2 , t 3, t4 ;F ? ) 

E 5(t 1 )5(t 2 )5(t 3 )5(t 4 )(g 4 + O d (q 7 ' 2 )) + O d (g 7 ) 
ti,t 2 ,t3,*4eC/nF g 

« d c/ 7 / 2 E |5(ti)5(t 2 )5(i 3 )5(i 3 )| + O d (g 7 ) « d iV 4 ^/ 2 , 
ti,*2,*3,*4ec/n-F 8 

where the most important step is using that S(t) is of mean zero to pass from the second 
to the third line. Thus |A ffl & < d N 4 q 7 / 2 , it follows that N > d \A\ 3 / 4 \B\ 3 / 4 q- 7 / 16 . □ 
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7 Proofs of Lemmas [23] and [24 



Before starting, we introduce a piece of terminology. 

Definition. An ra-dimensional variety V over an algebraically closed field K is said to be 
mainly irreducible if it has a unique irreducible component of dimension n. 

The following lemma will be very useful in showing that many fiber products are 
irreducible: 

Lemma 21. Let V, W be n- dimensional irreducible varieties over IK with dominant, finite 
maps f : V —> AJ^ and g: W —> Aj| of degrees prime to p = char K, and let V', W C AS be 
the subsets over which f, resp. g are unramified. Then if V' U W = A^, the fiber product 

V x&n W is irreducible. 

K 

Proof. Let F = K(V) and G = K(W) be the function fields of V and W considered as 
finite extensions of M = K(Ajg). Now V x_&g W is irreducible if and only if F and G 
are linearly disjoint over M. Since V U W = AS, the fields K and G have coprime 
discriminants over M. Let F', G' be the Galois closures of F and G over M respectively. 
Then F' and G' still have coprime discriminants, and so their intersection ¥' n G' is an 
unramified extension over M with degree prime to p, which must be M itself, since AS has 
no nontrivial unramified extensions of degree prime to p. But since F', G' are Galois, this 
implies that they are linearly disjoint, and hence the subfields F and G are also linearly 
disjoint, as desired. □ 

Note that by looking at the map between generic points the above proof carries through 
over the part of AJ| where / and g are finite. We thus have the following easy corollary: 

Corollary 22. Let V, W be n- dimensional, mainly irreducible varieties over K with dom- 
inant maps /: V — > A^ and g: W — >■ A^ of degree prime to p, and let V' , W C A^ be 
the subsets over which f , resp. g are unramified. Then if V U W = AS-, the fiber product 

V x^n W is mainly irreducible of dimension n with the possible exception of components 
that do not map dominantly to A^. 

Prom now on we adopt the notation of proof of Theorem [6l so let V be the variety 
defined by the equations 

{f{xi,V2) = f(x' 1 ,y' 2 ),f(x 2 ,yi) = f(x' 2 ,y[), f(x 3 ,y 2 ) = f(x' 3 ,y' 2 ),f(x 2 ,y3) = f(x' 2 ,y' 3 )} . 

Define also fi(x, y) and f 2 (x, y) to be the derivatives of / with respect to the first and 
second coordinates respectively. Let deg 1 f and deg 2 / be degrees of / in the x and y 
variable, respectively. 

Lemma 23. Under the assumptions of Theorem V is an 8-dimensional mainly irre- 
ducible variety. 
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Proof. Denote by W the variety 



{f(x 1 ,y 2 ) = f(x' 1 ,y' 2 ),f(x 3 ,y 2 ) = f(x' 3 ,y 2 )} , 

and by W the variety 

{f(x 2 ,yi) = f(x' 2 ,y' 1 )J(x 2 ,y 3 ) = f(x' 2 ,y 3 )} . 

Since V = W x W 1 , it suffices to show that W and W are mainly irreducible of dimension 
4. We focus on W, the case of W' being symmetric: 

Consider the map 4>: A 3 — > A 2 given by 4>(r,s,t) = (f(r,t),f(s,t)). Notice that W 
is the fiber product A 3 x A 2 A 3 of A 3 with itself with respect to the map <fi, so to show 
that the W is mainly irreducible it suffices by Lemma [35] to show that the fibers of (j> are 
1-dimensional and generically irreducible. 

The fibers of 4> are <f>~ 1 (a, b) = {f(r,t) = a,f(s,t) = b}. If we fix t, then by our 
assumptions on / we get a finite, non-zero number of solutions in r and s, so the fibers 
are one-dimensional. To see that they are generically irreducible, denote by z a the curve 
/(r, t) = a and let 7r a : z a — > A 1 be the projection map onto the second coordinate, that is 
onto t. 

Notice that q~ 1 (a, b) = z a x A i Zb with respect to the maps 7r a , 7T{>. Since / is not a com- 
posite polynomial, z a is generically irreducible by the Bertini-Krull theorem (Lemma I15|> . 
We are now in a position to apply Corollary [22] By our assumption on /, 7r a is a finite 
map for all a. By the Jacobian criterion, the ramification locus of 7r a on the base is the 
set of t for which there exists an r with /i(r, t) = 0, and /(r, t) = a. 

By our assumptions on /, for any fixed r there are only finitely many t such that 
/i(r, t) = and so finitely many a such that r is in the bad locus of 7r a . This implies that 
for generic a, b, the maps n a and tt\, have disjoint bad loci and so we can apply Corollary 1221 
This proves that W is 4-dimensional and mainly irreducible. □ 

Recall that V^^.t^M is the variety 

' f(xi,y 2 ) = f{x' 1 ,y' 2 ),f{x 2 ,yi) = f(x' 2 ,y[) 
f(x3, V2) = f(x 3 , y' 2 ), f(x 2 ,y 3 ) = f{x 2 , y 3 ) 
f(xi,yi) = h,f(x 3 ,y 3 ) = t 2 
f(x'i,yi) = t 3 ,f(x' 3 ,y 3 ) = t 4 

Lemma 24. The 4-dimensional family of varieties Vj li j 2i j 3> t 4 is generically 4-dimensional 
and mainly irreducible. 

Proof. Define W to be the variety {f(x 1 ,y 2 ) = fix'^y'^), f(x 2 ,y 1 ) = f(x 2 , y[)} and W tl ,t s 
to be the variety 

{f{xi,y 2 ) = f{x' 1 ,y' 2 ),f(x 2 ,y 1 ) = f{x' 2 ,y' x ),f{xi,y 1 ) = f{x' x ,y' 1 ) = t 3 ) . 



> . 
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We shall use the fact that Wt lt t 3 has a canonical map tt^ t 3 '■ W*i,t 3 ~~ * ^ 4 given by 
projecting onto the coordinates x 2 ,x' 2 ,y 2 ,y' 2 , and Vt!,t 2 ,t 3 ,t4 — Wti,t 3 x a 4 W*2,i4 with respect 
to these maps. 

Lemma 25. T/ie varieties t 3 ^re 4- dimensional and generically mainly irreducible. 

Proof. First, since / is non-composite, the variety {f(x±, y±) = t\, f(x' 1 ,y[) = £3} is gener- 
ically irreducible in x±, yi, x'^ y[ by the Bertini-Krull theorem. Now if we can show that 
for generic x 1 ,y 2 ,x' 1 ,y' 2 , the varieties {f(x 1 ,y 2 ) = f(x' x ,y 2 )} and {f(x 2 ,yi) = f{x 2 ,y[)} 
are 1-dimensional and generically irreducible, then we will be done by an application of 
Corollary [221 with respect to the projection map onto x%, y%, x' x , y[. 

We will handle the variety A xljX r := {f(xi, y 2 ) = f(x'i,y' 2 )}, the other one being sym- 
metric. For each xi,x[,y 2 we have a non-empty finite set of solutions for y' 2 , so A xuX > is 
clearly 1-dimensional. 

Define a 3-dimensional variety A by A = {f{x\,y 2 ) = f(x' 1 ,y' 2 )}. Since / is not 
composite, A is mainly irreducible. Consider the map (ft: A — > A 2 given by projection 
onto the xi,x[ coordinates. The fibers are precisely the A xi y . So by Lemma l36l A xi y^ 
is generically irreducible if and only if the variety A xA 2A, defined by 

f{x\,y 2 ) = /(xi,?^), f{x\M) = f{x' 1} b' 2 ) 

is mainly irreducible. But this is our W' from Lemma l23| which we have already shown 
to be mainly irreducible. This completes the proof. □ 

Lemma 26. T^ti,t 3 is generically dominant. 

Proof. This is equivalent to proving that the Jacobian of tt^ t 3 does not vanish identically 
on Wt x ,f 3 • The Jacobian is readily computed to be 

J = h{xi,yi)h{x2,yi)h(x' 1 ,y' 2 )f 2 (x' 1 ,y' 1 ) - fi(x 1 ,y 2 )f 2 (x 1 ,y 1 )f 1 (x , 1 ,y , 1 )f 2 (x' 2 ,y' 1 ). 

We have to show that J does not vanish on W. Define g(s,t) = tt 2 ^ ■ Notice that none 
of the fi or f 2 terms are identically on W. Assume J vanishes on W for the sake of 
contradiction. Consider J as a polynomial in y 2 , y 2 . Then the assumption that J = 
on W implies that on A Xl y := {f(x±,y 2 ) = fix'^y^)} the function ftfe'^ is a constant 
C(xi,x[). 

We showed above A Xl ^ is irreducible, which means that the function field of A x1)3 j 
is generated by y 2 ,y' 2 with y 2 being of degree deg 2 (/) over K(y 2 ). Since 

fi{x\,y 2 ) - C{x 1 ,x' 1 )fi(x' 1 ,y 2 ) = 

and the degree of fi{xi,y 2 ) — C(xi,x' 1 )fi(x' 1 ,y' 2 ) in y 2 is less than deg 2 (/), it must be 0. 
That implies that f(xi,y 2 ) = P(y 2 ) + Q(xi), which contradicts our original assumptions 
on /. So J does not vanish identically on W. □ 
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We are now almost ready to apply Lemma [221 if we can show that 7Tt 1 ^ 3 : Wt x ,t 3 — > A 4 
has no 'bad fixed locus'. That is, there is no divisor D G A 4 such that for all t\,t2 G IK, 
the map "Kt lt t 3 is either ramified over D, or is not finite over any point d G D. 

Case 1: Suppose there is some divisor D G A 4 such that for all points {y 2 , X2, y'2, x' 2 ) G D, 
TTt^tg is not finite over (1/2, X2, y'2, x' 2 ). Consider the projective closure W inside the 

s P ace P l riI ' 1>y ; x K2,x2,y' 2 ,x' 2 - We can extend 7Ttoa ma P 

7f : TF A 4 

which is now proper. Since vr <1<3 is not finite over x G A 4 2 ^ ^ x , , the preimage 

Tr^tgO?) has a point 'at infinity'. Since this is true for all tx,t2, it follows that 
7f~ 1 (if) has a 2-dimensional component at infinity. 

The variety Tt~ 1 (y 2 ,x 2 ,y' 2 ,x' 2 ) is cut out by the projectivized equations 

{f(x 1 ,y 2 ,e) = f(x' 1 ,y' 2 ,e),f(x2,yi,e) = f(x' 2 ,yi,e)} , 

where e is the homogenizing variable. Now, the component at infinity is given by 
e = 0. By our assumption on /, the defining equations of tt" 1 (y 2 , X2,y 2 , x' 2 )n{e = 0} 
become 

de gl (.f) _ >dc gl (/) dcg 2 (/) _ /deg 2 (/) 

This is a one-dimensional projective variety, which is a contradiction. 

Case 2: Suppose that there is a divisor D G A 4 such that for all ti, £3, the projection 7r <lj<3 is 
ramified over L>. That is equivalent to saying that tt^\ 3 (D) has a multiple component 
in Wtjfy- Since this is true for all t\,ts, it implies that ir (D) has a multiple 
component on W. Now, W is a direct product of Y = {f(x\, j/2) = /(^i) 2/2)} an d ^ = 
{/(x2,yi) = /(x^yi)}- There are projections maps 7ry : y — > A^ yl and 7r^ : Z — > 
x , such that it = iry x 7T£. This means that over each point x = (x2,x' 2 ,y2, y'2) in 
D, either Y^/ := {/(xi,y 2 ) = /K,2/ 2 )}, or Z XijX > % := {f(x 2 ,yi) = /0 2 >yi)} has 
a multiple component. We treat the case of Y y2y ' having a multiple component, the 
case of Z X2X i being symmetric. 

If Yy 2 ,y' 2 nas C as a multiple component, then fi(x±,y 2 ) = fiix^y^) = on C. 
But by our assumptions on / this is only a finite number of points, which is a 
contradiction. 

Applying CorollaryEU we see that for generic (ii, i 2 , £3, U), Wti,t 3 x A 4 W^,^ = Vt^fcM 
is 4-dimensional and mainly irreducible except for possibly over a proper closed subset 
Y" C A 4 . The following lemma rules out the existence of Y and so completes the proof of 
Lemma [24l 

Lemma 27. For generic ii,i2>i3>^4 ^ere is no proper Zariski-closed subset Y C A 4 ?cift 
Wti.ta X A 4 Wt 3 ,u being more than 3- dimensional over Y. 
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Proof. Note that the lemma is equivalent to the statement that for any Y C A 4 of dimen- 
sion at most 3, we have 

dim^y) + dimTr^Y) - dim(Y) < 4. 

To prove this, first observe that for any point y E A 4 the dimension of dimir^ ta (y) 
is at most 1. Moreover, since Wt lt t 3 is 4-dimensional and mainly irreducible and TTt lt t 3 is 
dominant, the locus in A 4 where the dimension of the fibers jump is at most 2-dimensional. 
Therefore, all we have to exclude is the existence of a 2-dimensional closed subvariety 
Y C A 4 such that for almost all points y E Y both dim^^ (y) and dim7r^ 1 t4 (y) are 
1-dimensional. Since we want the result for generic t\,ti,t^,t±, it suffices to exclude the 
case where a single bad variety Y exists for all ti,t%. If such a Y existed, then for almost 
all p E Y,diimr~ 1 (p) > 3. However, since by assumption the polynomial f(x,y) is monic 
in x, tt~ 1 (p) is a product of two non-degenerate curves, and so is 2-dimensional. □ 

□ 



8 Algebraic tidbits 

Often it is insufficient to know that some algebraic property holds generically, but one 
needs a bound on the degree of the exceptional set. The next lemma and its corollaries 
take care of this situation. 

Lemma 28. Suppose V C A" is a variety of degree d, and ir: A n — > A rn is the projection 
map. Let U = {x E vr(V) : 7r^ 1 (x) n V = A n ~ m }. Suppose dimU = dimV + m — n — r . 
Then U is contained in a variety of dimension dim U and degree at most d r+1 . 

Proof. Think of A™ as A m x A n_m , where ir(x,y) = x. Then for every yo £ A n ~ m 
write Vjj = V C\ {y = yo}. Note that for a generic yo the variety Vg is proper and of 
degree d. We define varieties W\ , Wi , . . . inductively. Let W\ = V$ x for some generic 
yi E A n ~ m . Suppose Wi has been defined, then either for a generic yi + i E A n ~ m the 
inequality dim( Wi n Vg i+1 ) < dim Wi holds or there is an irreducible component W of Wi 
of dimension dim TV' = dimWj contained in Vp i+1 for every choice of yi+i E A n ~ m . In 
the former case let Wj+i = Wi H Vg i+1 for a generic yi+i, and continue the sequence. In 
the latter case, the sequence stops with Wi. In that case since U = n^eA™-" 1 we have 
W' C U C Wi. Thus, dim U = dim Wi, and by Bezout's theorem deg Wi < Y\ deg V$ = d l . 
Since y\ E A n ~ m is generic, dimlYi + (n — m) < dimY. Finally, from dim Wj+i < dimVFj, 
dimlYi < diml/ + m — n and U C Wi, it follows that the sequence of Ws terminates 
after at most r + 1 elements. □ 

Corollary 29. If an irreducible polynomial f(x, y, z) of degree d is not of the form g(x, y), 
then there are at most d 2 pairs (a, b) for which f(a, b, z) is zero as a polynomial in z. 
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Proof. Let V = {f(x,y,z) = 0} and ir be the projection on (x,y). Then in notation of 
the preceding lemma, U is the set of pairs (a, 6) G A 2 for which f(a,b,z) = 0. Write 
f(x,y,z) = fi(x,y)z l . The set U is infinite if and only if all the share a common 
factor, which is contrary to the assumption on /. Thus dimU = 0, and the result follows 
from Lemma l28l □ 



Corollary 30. Let f be a polynomial of degree d inn variables, and suppose the polynomial 
f c (xi, . . . , x n -i) = f(x±, . . . , x n _i, c) has no linear factors for a generic c. Then there are 
at most d n {n — 1) values c for which f c does have a linear factor. 

Proof. Without loss of generality / depends non-trivially on each of cci, . . . , x n . Let 

C = {c : f c has a linear factor }. 

If a\X\ + • • • + a n -\x n _i + b is a factor of f c (xi, . . . , x n -i), then at least one of ctj is non- 
zero. Thus, without loss of generality there is C C C of size \C'\ > |C|/(n — 1) for which 
f c has a linear factor with non- vanishing coefficient a n -i- By rescaling, we may assume 
that for every c G C the linear factor is of the form a\x\ + • • • + a n -2X n -2 + b — x n -\. 
Define polynomial g in 2n — 2 variables by 

g(xi, ... , x n - 2 ,ai, ... , a n - 2 , b, c) = f(x 1} ... , x n _ 2 , aixi H h a n _ 2 x n -2 + b,c). 

Since / depends non-trivially on x n -i, the polynomial g depends non-trivially on b, thus 
the variety V = {g = 0} C A 2n ~ 2 is of dimension 2n — 3. Let U = {{a±, . . . , a n _2, b, c) : 
g(xi, . . . ,x n -2, ai, . . . , a n -2, b,c) = 0}. Since (oi, . . . , a n -2, b, c) G U if and only if a\x\ + 
• • • + a n -2X n -2 + b — rc n _i is a factor of f c , and f c has at least 1 linear factor, it follows 
that \C'\ < \U\. As C is finite, dimU = 0, and Lemma [28] implies \C\ < (n - 1)\U\ < 
(n - l)d n . □ 

Corollary 31. Suppose V C A" is an irreducible variety of degree d, the map ir: A n — > A rn 
is the projection, and dimV = dim7r(V). Let U = {x £ V : dim(-7r _1 (7r(a;)) n V) > 0}. 
Then U is contained in a subvariety ofV of codimension 1 and of degree at most d 3 (n—m). 

Proof. Factor ir as 7r = a n - m • • • a\, where each dj is a projection collapsing a single 
coordinate. Let 7Tj = cxj • • • o\ and 

Ui = {x G V : a^(ni(x)) C\V = A 1 }. 

Since dim7r(y) < diimriiV) < dimV, it follows iTi(V) = dimV. If dimt/j = dimV, 
then Ui = V by irreduciblity of V, which would contradict dim7rj(V) = dimV. Thus by 
Lemma [28] the degree of TVi(Ui) is at most d 2 . Since U is contained both in V and in the 
union of 7r^ 1 (7Tj(L r j))'s, the corollary follows from Bezout's theorem (Lemma 113ft . □ 
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Lemma 32. Suppose n > 3 and let f(x±,X2, »,x n ) be an irreducible polynomial of degree 
d with no linear factors over an algebraically closed field K. Then there is a coordinate Xi 
such that if we fix the value of Xi to an element c£ K, then for all but d n (n — 1) values of 
c, the resulting polynomial f(x\,X2, ■ ■ ■ , c, ccj+i . . . , x n ) also has no linear factors. 

Proof. We will in fact show a stronger result that one can take X{ to be one of xi,X2,x%. 
Since the roles of the variables are not symmetric, it is convenient to rename them 
x,y,z,wx,. . .,w n - 3 . 

Assume the conclusion of the lemma is false. Then, by Corollary [30l for all elements 
c G K, /(c, y, z, Wi, . . . , x n s) has a linear factor as a polynomial in y, z, w\, . . . , io n -3- 
Likewise for the y and z coordinates. 

Moreover, these linear factors must have coefficients that are algebraic over say 
a(x)y+ (3(x)z+^i(x)w\ + • • • +^ n ^{x)w n -^ + 5(x) . The function a(x) vanishes only \id y f 
vanishes as well. Thus if a{x) vanishes infinitely often, then d y f vanishes on a subvariety 
of {/ = 0} of dimension n, which by irreducibility of / implies that / does not depend on y. 
If / does not depend on y, then the lemma is trivially true. Thus we may assume that for 
a generic x the linear factor is of the form — y + f3(x)zA-^i{x)w\ + - ■ ■ -\-^ n - 3 {x)w n - 3 + 5{x). 
Since / is irreducible, 

f(x,y,z,w±,..., w n -z) = t(x) \\{-y + Pj{x)z + j 1 j{x)wx H h j n -3,j(.x)w n - 3 + Sj(x)) 

j 

where the product is over the conjugates of ((3(x), 7i(x), . . . , 7„_3(x), 5(x)). 
By the same reasoning applied to z instead of x, 

f(x,y,z,w 1 ,... ) w n - 3 ) = t'(z) (-y + /3j (z)x + -y' ld (z)w 1 H h j' n _ 3d {z)w n - 3 + Sj (z)) . 

j 

Thus comparing these two descriptions of the roots of / considered as a polynomial in y 
over K(x, z){w\, . . . ,w n -. 3 ), we conclude that jij = 7^ are constant, and 

p(x)z + 6(x) = f3'(z)x + 5'(z). 

Thus P, f3' , 5, 5' are linear, and / is of the form f(x, y, z, w±, . . . , w n s) = Axz + Bx + Cy + 
Dz + EiWi. Since for fixed y = c the polynomial f(x, c,z,...) has a linear factor, it 
follows that / is itself linear, a contradiction. □ 

Lemma 33. Let W be a non-linear irreducible variety of dimension m in A n , with coordi- 
nates being x±,X2, .-,x n . Then there are m + l coordinates Xi i: Xi 2 , ...,Xj m+1 such that the 
projection ofW onto their span is contained in an m- dimensional non-linear hypersurface. 

Proof. We induct on n. If n = 1 or n = 2, there is nothing to prove. Suppose n > 3, and 
consider the n coordinate hyperplanes, A™ -1 = {xi = 0}, and let Wi be the projection of 
W onto A™ -1 . Suppose there an i such that dim Wi = m— 1. Then W = Wi x Aj because 
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W is irreducible. Thus Wi is non-linear, and by induction there is a projection of Wi onto 
the span of {xj}j e s- Then projection of W onto the span of {xj}j£Su{i} 1S contained in a 
non-linear hypersurface. So, we may assume all the Wi are of dimension m. 

Introduce a vector space structure on A n in such a way that G W. Let L = spanW 
be the vector space spanned by W, and write Li for the projection of L onto A" -1 . Since 
W is non-linear, dimL > dimVF + 1 = m + 1. If dimLj = m, then i'th basis vector ej 
is in L. If dimLj = m for all i, then A n = spanjei, . . . , e n } C L, implying n = m + 1, 
in which case there is nothing to prove. Thus, we can assume there is an i such that 
dimLj = m + 1. But then Wi is nondinear, and the results follows from the induction 
hypothesis. □ 

Lemma 34. Let f(x) be a non-constant rational function in ¥ q (x) of degree at most 
q — l- Suppose also there are non-zero rational functions P(s,t) and g(s,t) and constants 
a,b G ¥ q such that P(g(x,y),af(x) + bf(y)) = 0. Then there exists a rational function 
G(x), such that g(x,y) is one of 

G(x),G(y), orG(af(x) + bf(y)). 

Proof. Let IK be the subfield of ¥ q (x, y) consisting of all elements algebraic over ¥ g (af(x) + 
bf(y)). Note that g(x, y) G K by assumption. Since / is non-constant, K has transcendence 
degree 1 over ¥ q . We claim that K is isomorphic as a field to ¥ q (t). To see this, first note 
that IK is finitely generated, since its a subfield of a finitely generated field. So K is the 
function field of a smooth, non-singular curve C over ¥ q . Also, the embedding K C ¥ q (x, y) 
corresponds to a dominant rational map from A| to C. But if C was not birational to 
A 1 , then this map would have to be constant on every line, since a curve cannot map 
non-trivially to a curve of higher genus. But this contradicts that the map is dominant. 
So K is indeed generated by a single element. 

If a or b are 0, then IK is generated by one of y or x, and we are done. Suppose then 
that neither a nor b is 0. Then it remains to prove that K is generated by af{x) + bf{y) 
over ¥ q , or equivalently that af(x) + bf(y) is a non-composite rational function. That is, 
there are no rational functions Q(t) £ ¥ q (t), and r(x,y) £ ¥ q (x,y) such that Q(r(x,y)) = 
af(x) + bf(y) and deg(Q) > 1. Suppose for the sake of contradiction this is the case. 

Since Q{t) is a rational function of degree at least 2, Q must be ramified over at least 
one finite point, say over c S ¥ q . This means that Q{t) — c has a double root at some point 
c' G ¥ q , so that Q(r(x, y)) — c = has a multiple component of the form (r(x, y) — c') 2 , and 
af(x) + bf(y) — c = must also have a multiple component. But by the Jacobian criterion, 
af(x) + bf(y) — c = is only singular at points (xo,yo) sucn that f'(xo) = f'(yo) = 0. 
There are only finitely many of these points, so af(x) + bf{y) — c = cannot have multiple 
components. This contradiction finishes the proof. □ 

Lemma 35. Let p: V — > W be a dominant, equidimensional map such that W is irre- 
ducible of dimension m, and for a generic point w G W , p~ l (w) is irreducible of dimension 
n. Then V has a unique irreducible component of dimension m + n. 
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Proof. That the dimension of V is m + n follows from dimension theory, so assume V 
has two disjoint components of dimension m + n, V\ and V2 with V\ U V2 = V . For an 
open set U C W we know that whenever u £ U, p~ l (u) is irreducible. So p~ l {u) lies in 
either Vi or in Vi- This means that p{V\) CipiVi) is of dimension less than m. However, 
since the fibers of p are of dimension n, dimension theory says that each of p{Vi),p(V2) 
are of dimension at least m, and hence exactly m. But then p(Vi),p(V2) are two distinct 
components of W, contradicting the irreducibility of W. □ 

Lemma 36. Let f : X — >■ Y be an equidimensional map with X, Y irreducible, dim(y) = 
m, dim(X) = n. Then f~ l {y) is generically mainly irreducible of dimension n — m iff 
X Xy X is mainly irreducible of dimension 2n — m. 

Proof. Let L = F g (X),K = ¥ q (Y). Since having at least two maximal reducible 

components is a Zariski-closed condition on Y, looking over the generic point we see that 
the theorem is equivalent to the following statement about fields: 

L <S>k L is a domain <?=^ L ®k IK is a field. 

Call the above statements (i) and (ii) respectively, and consider the following additional 
statement: (iii) IK is algebraically closed in L. We will show that both conditions are 
equivalent to (iii). 

(ii) =^> (i) . To prove L <S>k L is a domain, it is enough to show that L ®k L <8>k K is a 
domain, and the latter is 

L <g> K L ® K E ^ (L ® K K) ®^ (L ® K K) 

which is a domain, since the product of geometrically irreducible varieties is irreducible. 

(i)^=^> (iii). Suppose (iii) fails to hold so that L contains a finite algebraic extension 
M of IK such that M/K. Then L<8>k1K contains a copy of M®iM which is not a domain, 
so that (i) fails to hold as well. 

(iii) =^- (ii). Suppose not, so that IK is algebraically closed in L, but L S3k IK is 
not a field. Since IK is a union of finite extensions of IK, there must exist some finite 
algebraic extension M of IK such that L ®k M is not a field either. We can present M as 
M = K[x]/(P(x)) for some irreducible polynomial P(x), so that L <g> K M = h[x]/(P(x)). 

Since L,[x]/(P(x)) is not a field, P{x) must factor as P(x) = Q(x)R(x), where Q,R 
are polynomials with coefficients in L. But the coefficients of Q and R can be expressed 
as polynomials in the roots of P, and are therefore algebraic over IK. This contradicts the 
fact that IK is algebraically closed in L. □ 

9 Problems and remarks 

• We expect Theorem [6] to hold without the condition that / is monic. In fact, the 
proof presented above holds provided the irreducibility of V and Vt t ,t2,tz,u can De 
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established, and it is there that the condition that / is monic is invoked. Whereas 
the monicity condition on / can be relaxed, it cannot be removed completely in view 
of the following counterexample due to Vivek Shende: 

Example. Let p(x), and g(x,y) be two non-linear polynomials, such that g(x,y) 
depends non-trivially on y, and p(x) is not a square. Then for 

f(x,y) = p(x)g(x,y) 2 , 

the variety V in the proof of Theorem is reducible. 

Nonetheless, it seems likely that sum-product phenomenon should persist in the 
absence of any 'group-like structure'. More precisely, we believe in the following 
conjecture: 

Definition. Let (G, +) be a one-dimensional abelian algebraic group (i.e. G is either 
G m , G«, or an elliptic curve), and define Go to be 

Go = {(51,52,5s) e G 3 I 51+52 + 53 = 0} . 

An irreducible surface V C A 3 is said to be group-like with respect to G if there 
are irreducible curves C\ , C2 , G3 , and an irreducible surface W C C\ x C2 x G3 , and 
rational maps /j : C{ — > A 1 , gi : C% — > G such that the Zariski closure of (/1 x fa x 
fo)(W) is V and the Zariski closure of (g\ x g 2 x 53) (W) is Go- 

Conjecture. There exists an absolute constant 5 > such that whenever f £ 
¥ p [x,y,z] is an irreducible polynomial that depends non-trivially on all three vari- 
ables, and A a subset of¥ p , then either 

N(f;A,A,A)< d m a x(\A\ 2 ~ s ,\A\ 2 (\A\/p) 5 ) 

or the surface {/ = 0} is group-like with respect to some one- dimensional abelian 
algebraic group. 

Intuitively, the conjecture states that the only examples of surface {/ = 0} containing 
many points on the Cartesian products are of the form 

f(x,y,z) = F(G x (x)®G y (y)®G z (z)) 

for some group operation ® and algebraic functions F, G x ,G y ,G z . 

• The proof of degf = 2 case of Theorem Q] can be modified to show that \A + A\ + 
\f(A) + B\ 3> \A\ ji?) 1 / 1000 for every quadratic polynomial / whenever \A\, \B\ < ^fp. 
The modification requires a sum-product estimate on \A + A\ + \A -q B\ where 
G C A x B is a dense bipartite graph. Such an estimate can be established by 
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a simple modification of the proof in [Gar08a] of the case G = A x B. However, 
as the resulting proof is long and lacks novelty, it is omitted from this paper, but 
can be found at http: //www.borisbukh. org/sumproductpoly_quadratic .pdf It 
remains an interesting problem to show that \A + A\ + \f(A) + B\ S> |^4||i?| e for 
some e = e(deg / ) > for polynomials of any degree. 
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